Cortical free association dynamics: distinct phases of a latching network 
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A Potts associative memory network has been proposed as a simplified model of macroscopic 
cortical dynamics, in which each Potts unit stands for a patch of cortex, which can be activated in 
one of 5 local attractor states. The internal neuronal dynamics of the patch is not described by 
the model, rather it is subsumed into an effective description in terms of graded Potts units, with 
adaptation effects both specific to each attractor state and generic to the patch. If each unit, or 
patch, receives effective (tensor) connections from C other units, the network has been shown to be 
able to store a large number p of global patterns, or network attractors, each with a fraction a of 
the units active, where the critical load pc scales roughly like pc ~ CS^ /ahi{l/a) (if the patterns 
are randomly correlated). Interestingly, after retrieving an externally cued attractor, the network 
can continue jumping, or latching, from attractor to attractor, driven by adaptation effects. The 
occurrence and duration of latching dynamics is found through simulations to depend critically 
on the strength of local attractor states, expressed in the Potts model by a parameter w. Here 
we describe with simulations and then analytically the boundaries between distinct phases of no 
latching, of transient and sustained latching, deriving a phase diagram in the plane w — T, where 
T parametrizes thermal noise effects. Implications for real cortical dynamics are briefly reviewed in 
the conclusions. 
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I. INTRODUCTION 

The Hopfield [l| model and its subsequent analy- 
sis with statistical physics techniques [2| have provided 
an important quantitative theoretical framework to ap- 
proach the operation of associative memory networks in 
the brain. After many years of modeling developments 
and gradual familiarization by the neuroscience commu- 
nity [3|, i4i] , the notion of attractor states forcefully put 
forward by the Hopfield model has guided experimental 
investigations in the rat hippocampus [5|, [g and in the 
primate inferotemporal cortex [TJ, |8|, suggesting analy- 
ses of the time course of attractor dynamics which have 
revealed a striking rhythmically pulsed or "flickering" 
regime Q- 

Attractor dynamics has been proposed to be relevant 
to global cortical function [I^, \lM : also in situations in 
which the cortex actually produces behavior, e.g. speech, 
confabulation, or a drawing, as opposed to just respond- 
ing to a stimulus from outside |134l8| . It appears then 
of particular interest to go, within a modeling frame- 
work, beyond the standard cued retrieval operation, to 
protracted cortical dynamics including attraction to, and 
escape from, a sequence of metastable states, or attrac- 
tor "ruins" [1^, [20|. Firing frequency adaptation is a 
simple and salient feature of neuronal dynamics in the 
cortex, which results in the unavoidable metastability of 
any of its attractor states. The study of a full neuronal 
network model of an extended "cortex" with firing fre- 
quency adaptation is complicated, however, by the ne- 
cessity to treat simultaneously local and global dynam- 



ics 12 ill . The Potts associative memory network model 
|23 . l23fl can be interpreted as a global cortical model, 
in which local dynamics are subsumed into an effective 
description [2J, l2a|- 1*^ storage and retrieval capacity 
(without adaptation) can be studied analytically |26| . 
With adaptation, it shows latching behavior, i.e. a se- 
quence of jumps from one attractor to another (Fig. [ij , 
with non-trivial transition statistics, neither random nor 
deterministic [23, llg . The aim of the present study is 
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Figure 1: (Color online) Examples of latching dynamics in the 
Potts network. The correlation (overlap) of the network state 
with each of a number of stored activity patterns is indicated 
in a different color. The retrieval of a memory can be followed 
by the inactive state (a) or by latching dynamics, that either 
die out after a few steps (b) or self-sustain indefinitely (c), 
depending on the parameters of the network. 



to characterize the conditions for latching dynamics to 
occur, and to self-sustain indefinitely. The model is de- 
fined in the next section, and the various latching regimes 
are described in Sec. IIIII Then in Sec. IIVI an analytical 
approach is developed, that yields a phase diagram for 
the model. Its implications for real cortical dynamics are 
reviewed in the last section. 



II. THE MODEL 

The network considered here is the same as studied 
previously [2J| and we refer to that paper and to Ref. (25| 
for a discussion of the correspondence with real cortical 
networks. Here we dwell on the mathematical description 
of the model, based on the hypothesis that, abstracted 
from observed features of cortical representations |29j . 
it may be relevant to a better understanding of cortical 
dynamics, over time scales of up to a few seconds. 




Figure 2: (Color online) Conceptual derivation of the Potts 
network, obtained as a simplification of a two-tier associa- 
tive memory network by reducing its local dynamics to single 
Potts units, representative of the states of patches of cortex. 



unit j reads |23l . |2f 



A. Potts units 

Let us consider a network comprised of many subnet- 
works interacting with each other with long range connec- 
tions. Each of these small networks has its own attrac- 
tors, with which it tends spontaneously to align during 
the dynamics. So, while in a brief transient condition 
the activation state of each small network may be par- 
tially overlapping with several of its attractors, during 
a protracted quasi-stable memory retrieval condition its 
alignment with one of the attractors tends to be near 
complete. 

To summarily describe these local dynamics we rep- 
resent the small subnetworks with Potts units, each of 
which can be active, to a variable degree from to 1, 
in a number S of distinct active states, representing lo- 
cal attractors, and in one inactive state, when the local 
activity pattern has no correlation with any of the at- 
tractors (Fig. [5]). At any time, the sum of the activity 
levels of unit i, a^ , k £ [0, S], {k = denotes the inactive 
state) is kept fixed to 1, J2k=o '^i ~ ^■ 

The model thus reduces to an autoassociative network 
of N Potts units bound by long range connections. The 
connections, stored in the weight matrix of the network, 
are fixed, with the standard assumption [l| that they 
reflect an earlier learning phase. By setting the weights, 
one defines the global attractors of the system. These 
are configurations of states of Potts units, that represent 
combinations of local attractors in each subnetwork, and 
in turn serve as attractors for the global dynamics of 
the Potts units. In particular we take the network to 
have stored as global attractors p activity patterns ^^ 
(see Sec. IIID|) . with ^ = 1, . . . ,p. The strength J^^' of 
the connection between state k of unit i and state I of 
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where C is the average number of units connected with a 
given unit (denoted as cm in |24| to indicate connectivity 
between Modules) and a is the sparsity parameter, i.e. 
the fraction of units in an active state in each of the stored 
patterns [24| . The connection matrix element Cij is equal 
to 1 if the presynaptic unit (module) j is connected to 
the postsynaptic unit (module) i, and otherwise. The 
connection tensor J^^ can be written more succinctly by 
dropping the last two factors and intending it to act only 
between active states, given that the inactive state of 
each unit, a'^, is not an independent variable because of 

the constraint J2k=o'^i ^ -'^• 

A simple update rule can be assigned to the network in 
discrete time, by randomly selecting each unit at a time, 
evaluating the local field in the direction of its different 
states /i*^ = J2j Jij'^\ (plus any input from outside the 
Potts network), and aligning the unit in the direction 
k* of the strongest field if this surpasses a threshold C/, 
(j^ =1, and setting it in the inactive state, tT° = 1, 
otherwise. The thermodynamics of such a model, for 
has been analysed by [2g, deriving it from the 



Hainiltonian 



H 



N S N S 



(2) 



i.j^i fc,/ — 1 



fc=l 



which can be thought to act under the infiuence of 
noise, modeled by introducing a non-zero temperature 
T = /3^^. The network stabilizes into one of several pos- 
sible asymptotic states, the global attractors, which may 
include clusters of configurations around the quiescent 



state {cr,° = 1} as well as around the memory attractors 
{a^ = 1} (all other states of a unit in the low noise 
regime being at ai ~ 0). For T ~ (3^^ ~ (no ther- 
mal noise at all) the asymptotic states reduce to a single 
configuration, and no further updating occurs. 

We aim here for a more realistic description of local 
cortical dynamics, which occurs in continuous time and 
includes a role for neuronal fatigue, to see the resulting 
effects on global dynamics. 



thermal noise in the local network underlying each Potts 
unit, but the global network is noiseless. See [i^l for 
the analysis of a related model but with different, noisy 
dynamics. 

The inputs are linearly integrated 
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B. Adaptive dynamics 

Transitions among attractors may be induced by a va- 
riety of mechanisms that either weaken temporarily the 
strength of the current attractor or produce an enhanced 
cue in the direction of another one |3(H37| . 

The stability of the stored memories is weakened, in 
the model considered here, by the adaptive thresholds in- 
troduced to model neuronal fatigue and slow inhibition. 
Two types of time-varying thresholds are included: an 
additional contribution to the constant baseline thresh- 
old U, affecting all active states of a Potts unit, 6'°(t), 
with a time constant T3 of hundreds of ms for its dy- 
namics, intended to model various forms of slow, delayed 
inhibition within a cortical patch |38l440| . and a specific 
threshold 0^{t), affecting only Potts state fc, with a time 
constant T2 of tens of ms for its dynamics, intended to 
model resource depletion of the neurons (fatigue proper) 
and of the synapses (short-term depression [4l|) active in 
that state. With these modifications, after the stabiliza- 
tion into a global pattern, all the active Potts units are 
increasingly affected by fatigue and, with two characteris- 
tic time constants, tend to inactivate or to change active 
state, eventually changing the activity configuration of 
the network. Thereby the system is pushed away from 
the basin of attraction it had settled in and, sometimes, 
towards that of a new correlated memory attractor. 

The activity of each unit is determined by a set of spe- 
cific input variables r^{t), which rapidly integrate, with 
time constant ti (meant to be in the ms range), the lo- 
cal fields, i.e. the summed influence h^{t) of presynaptic 
units, subject to the dynamic thresholds 0^{t), while the 
non-specific thresholds 9^ (t) and U can be thought of as 
inputs to the inactive state. In detail, the activation of 
the unit i is set at any time t by the non-linearity 
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for the null state, with /3 an inverse temperature 
parametrizing thermal noise, as in the Hopfield model |l| . 
Note that T = /3^^ > implies a smooth transfer func- 
tion for the Potts units, but no stochasticity in their dy- 
namics. The temperature in other words represents here 
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where it is important to note that we have added a self- 
reinforcement term with coefficient w, to model the non- 
linear convergence towards the more active state, induced 
by the local patch dynamics not explicitly represented in 
the Potts model. The w term is then a key difference be- 
tween the original Potts autoassociative network model, 
as studied by Kanter [22|, and our version, intended as an 
effective model of a two-tier network. The parameter w 
provides a positive feedback, that makes the convergence 
towards a local state a rapid self-regenerative process; it 
can be argued to numerically reflect the ratio of local to 
external afferent inputs, to each neuron in the two-tier 
network [43|. We shall call it the local feedback term. 

The adaptive thresholds follow, with slower dynamics, 
the mean activity of the unit 
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These equations completely define the dynamics of the 
network. 

Note that in the limit r2 , r3 — > 00 the model sheds the 
adaptive character of its dynamics, and genuine attractor 
states are indefinitely stable. 

Stability is also maintained when attractors are exceed- 
ingly "deep", that is, if the local feedback w is strong 
enough to overcome the effects of the rising thresholds. 
Then the thresholds stabilize at the asymptotic values 
6^ — )■ a^ and 0° —?■ ^j. af, and the input variables {r}'s 
satisfy the asymptotic system r'^ = h^ — 0^, where both 
the {h}'s and {0}'s are functions of the {(t}'s, hence of 
the {r}'s through Eqs. [3l Away from those limit cases, 
adaptation tends to destabilize memory attractors. 

The one state that is guaranteed to remain stable (pro- 
vided the constant threshold U takes a positive value) is 
the "global null state", where all active states take low 
values, vanishing exponentially with /?, and <t" ~ 1 (the 
exact values depend on /3 and U). 



C. The slowly adaptive regime 

The presence of adaptive thresholds makes an analyti- 
cal approach to the study of the dynamics quite difficult, 
in general. Were it not for the adaptive thresholds, and 
assuming symmetric connectivity, Cy = Cji, one would 
consider the energy function 
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Due to the moving thresholds, however, no energy or free- 
energy can be defined, to also describe how the thresh- 
olds themselves depend on the activity. Nevertheless, we 
are not so interested in the model in general, but rather 
specifically in the regime in which neuronal dynamics is 
faster than threshold dynamics, both intended at the lo- 
cal population level, i.e. when t^ > T2 ^ ti (the slowly 
adaptive regime). 

In the slowly adaptive regime the attractors, which 
were asymptotically stable without adaptation, gradu- 



ally become unstable. By going to the limit in which 
Ti is much shorter than the slow T2 and Ty, times, one 
can conceptualize dynamics as occurring over completely 
separate time scales. 

In a time of order ri the {cr} configuration relaxes 
towards an attractive metastable state, determined by 
nearly constant values for the thresholds, following a con- 
tinuous time version of the discrete-time updating dy- 
namics mentioned in Sec. Ill Al Having found a metastable 
state, an "attractor ruin" in the language of Tsuda |20| . 
the {cr}'s can be rigidly entrapped there, especially if 
thermal noise if very low, /3 ::!> 1. At this point neu- 
ral dynamics becomes almost stationary, while thresholds 
continue to evolve, albeit slowly. On a time scale r2, and 
with nearly constant {cr}'s, the thresholds change linearly 
until they destabilize the {cr}'s again. In this joint limit 
''"2jT'3 3> Ti and /3 3> 1, then, and again assuming sym- 
metric connectivity, one can describe separately the two 
dynamic modes, each governed by a distinct functional. 

The rapid transient dynamics towards an attractor 
minimize the classical Energy 
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where we have just added the extra local feedback term 
(the second in the first row), and the thresholds appear 
as parameters, in the second row, while the second term 
in the second row follows the formulation proposed for 
quasi-binary graded response units by Hopfield [4J| . This 
last term can be rewritten, considering that rf = U+6^ + 
(1//?) ln(cr*^/(T^) and carrying out the integrals, as 
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which clarifies its nature as an "entropy" term. 

The slow adaptation dynamics minimize instead the 
Adaptation function 
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where now the activation variables appear as parameters. 



Note that if we assume that over certain temporal in- 
tervals the thresholds {6} are roughly constant, we have 
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^ ) because the matrix afiSki 



(il), which has determinant 
A = nLo ^f and trace Tr = ^t, af (1 - af ) can be 
seen by induction to be positive definite; whereas if we 
assume that over other temporal intervals the attractor 
variables {(t}'s are roughly constant, we have instead 
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Therefore whenever the system is falling or jumping into 
a new attractor the dynamics are effectively governed by 



the energy function, while during the protracted peri- 
ods of permanence in the attractor, it is the adaptation 
function to be minimized, while the energy grows again, 
preparing for the next jump, as illustrated in Fig. [31 
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Figure 3: (Color online) (a) Latching transitions; (b) cor- 
responding value of the Energy (red curve) and Adaptation 
function (blue curve). During a transition, on a fast time 
scale, the Energy decrease drives the system to a new attrac- 
tor state. Then, on a slower time scale. Adaptation desta- 
bilizes the attractor, pushing the network towards a new 
transition. As in Fig. [l] time is expressed in sec, assuming 
Ti = 10 msec. 



A set of simulations were conducted, however, consid- 
ering what may be called a rapidly adapting regime, when 
in particular T3 is very short, T3 <C ti < T2, which has 
the computational advantage that rich dynamics unfold 
within limited CPU time. Approximate analytical con- 
siderations derived from those applicable to the slowly 
adapting regime can be extended to this rapidly adapt- 
ing regime, which provides a sort of control case, although 
physiologically less plausible (it would model fast inhibi- 
tion, but not its important slow component [39:j). 



D. Correlated patterns 

With the introduction of the adaptive thresholds, the 
network eventually slips away from any attractor it may 
have retrieved. A free association is said to occur if the 
system then jumps into a new attractor. As shown in 
(45| , such a jump is facilitated if there is a second mem- 
ory pattern correlated with the first one. Indeed, while 
individual units are adapting and slowly changing their 
activation, those that show the same activation in both 
patterns act as a cue for the retrieval of the second mem- 
ory. 

To generate a set of correlated patterns we use the al- 
gorithm presented in |24| . The algorithm is comprised 
of a two-step procedure. In the first step a number of 



mutually uncorrelated vectors, called factors, are estab- 
lished. Each factor can influence the activation of some 
of the units, by "suggesting" a particular state. In the 
second step the competition among these factors deter- 
mines the final activation state of each unit in each of the 
p patterns. 

In particular each factor acts on a distinct, but not ex- 
clusive, subset of Naf units with a strength that varies 
proportionally to the relevance of the factor itself and to 
a random number ranging from to 1, which is nonzero 
with probability Cp/. The relative strength (relevance) 
of the factors decreases exponentially, on average, from 
the most relevant factor downward. The exponent C can 
be varied parametrically from a small value, producing 
nearly equivalent factors, to large ones, C sa 1, resulting 
in the dominant influence of the first few factors. Each 
pattern has therefore a probability Op/ to be influenced 
by a given factor. After all the relevant factors have ex- 
pressed their suggestions on an activity direction among 
the S possible ones, a full activation [a^ = 1) is assigned, 
in the state k most activated by the factors, to the Na 
most activated units (most activated in the selected di- 
rection), while the remaining N{\ — a) ones are set in the 
inactive state. In this way, a set of p patterns is produced 
with exactly N{\ — a) units in the inactive state and Na 
units fully aligned with one of the S active states. 

The algorithm thus produces a set of patterns with 
a certain degree of correlation. The correlation can be 
tuned acting particularly on two parameters: apf and the 
exponent C of the strength. Increasing the proportion of 
factors active in a pattern, apf, the average correlation 
between pairs of patterns increases almost linearly, as 
measured by the averages of the number of units active 
in the same state in both patterns, Nas, or in different 
states, Nad, or both inactive in the pair, A'oo, from the 
reference values taken for randomly generated patterns 
(not shown). The ^ exponent acts instead on the relative 
relevance of each factor. The downstream effect on the 
averages of Nas, Nad and iVoo is non-monotonic, as the 
average correlation increases with C for C, small, but then 
it decreases; and its overall magnitude is limited anyway 
(as shown in FigU]). The effect is more salient on the 
variability of the smaller among these numbers, around 
their averages; in particular the average of Nas is quite 
small (just above 4 units with the parameters of Fig. U), 
and its variability from pair to pair is considerably larger 
than the average. In other words, the algorithm produces 
patterns that overall roughly cover the same portion of 
their high-dimensional pattern space, with similar aver- 
age correlations. But when a limited number of factors 
are relevant local relations are altered, with patterns typi- 
cally having a neighborhood of other patterns with which 
they share many units active in the same state, and re- 
mote patterns with which they have no units in com- 
mon, active in the same state. It is a non-transitive type 
of neighborhood (as the active units that deflne it vary 
from pair to pair) , so it does not imply any clustering in 
pattern hyper-space. 
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Figure 4: (Color online) The generation algorithm produces 
a set of correlated patterns, whose correlation can be var- 
ied with the C, exponent. With Potts units, where each can 
assume a variety of active states, to gauge the correlation be- 
tween different patterns (network configurations with a fixed 
number of active units) one has to take in account the number 
of units shared by the two patterns in the same active (Naa , 
red, bottom) and in the inactive (A^oo, black, top) state; the 
number {Nad, pink, third from top) of units active in two dif- 
ferent states; and that {Nao = Noa, blue, second from top) 
of units active in a configuration and inactive in the other. 
The horizontal lines refer to the reference values of Nas, Nad, 
Nao = Noa and A'oo, expressed as percentages, in the case 
of randomly generated patterns, with apf = 0.3, a = 0.25, 
N = 600, p = 140 and S = 9. The effect of ( on the average 
of these numbers is limited, as shown here in percentages, but 
intermediate values of (^ greatly amplify the variability of Nas 
around its average. 



In [43 we focus especially on the effect of the corre- 
lation between two patterns on the dynamics of a single 
latching transition. Here, instead, we aim to study how 
structural network parameters influence the dynamics of 
the whole latching sequence. Therefore, for the sake of 
simplicity, in most of the analyses we use a set of ran- 
domly generated patterns (apf = and C = 0), leaving 
to a brief section the discussion of how correlations alter 
the latching phase diagram. It is clear, however, that 
having more "neighbors" to any current pattern, with 
non-zero Upf and (., facilitates latching transitions and 
also determines which transitions are likely to occur. 

For random patterns, their average correlation is set by 
the sparsity a and the number of active states S. Indeed, 
two patterns on average share A'^(l — a)^ inactive units 
and they have 27Va(l — a) units active in just one of the 
two patterns. No? / S units active in the same state and 
N{S — V)a} I S units active in a different activation state 
(the reference values in Fig.|31). 



E. Storage capacity 

A first observation, about the correlation between pat- 
terns, is that increasing it strongly affects the network 
retrieval performance, i.e. its storage capacity. 

The capacity of the Potts network to store informa- 
tion, as with any neural network, is limited. For an au- 
toassociative network, in particular, there is typically a 
maximal number pc of patterns, called its storage capac- 
ity, that can be stored without significantly affecting the 
ability of the network itself to retrieve each of them from 
a partial cue [^. Exceeding this number, at the presenta- 
tion of the cue the network, once driven solely by internal 
(recurrent) inputs, reaches a state uncorrelated with any 
of the memory patterns, or only partially correlated with 
the memory to be retrieved or, even, with a "wrong" one. 

In |26| the storage capacity pc is analyzed for a vari- 
ety of Potts autoassociative networks. For each of those, 
the capacity limit is studied as a function of the global 
parameter a, quantifying the sparsity of the memory pat- 
terns and indirectly of the network activity, and of the 
local parameter S, the number of Potts states. 

The network considered here is a graded-response and 
adapting variant of what in that analysis is called a 
"sparse" network (irrespective of activity becoming re- 
ally sparse only in the limit a <C 1). In the further "ther- 
modynamic" limit N,p, P ^ (X), with a = p/N finite, its 
storage capacity is found to scale, for a ^ 1, as 
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for a fully connected network or, for a diluted network as 
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which are closer estimates than the simple pc ~ NS"^ /a 
or Pc ~ CS"^ /a, over a broad range of values of a, as they 
are for standard autoassociative networks \A^ . 

This analytical result is validated by numerical calcu- 
lations and by network simulations, but it only holds for 
the storage of random, uncorrelated patterns. One may 
now consider how an enhanced correlation among patters 
affects the storage capacity. 

In a symmetric recurrent network in which one can 
define a Lyapunov or energy function, the correlation 
among memory patterns influences directly the energy 
depth of the network attractors, and hence its storage 
capacity. In a condition in which the stored patterns are 
few and uncorrelated, during the retrieval of a memory 
the overlap of the state of the system with that particular 
pattern is close to one, and close to zero with the others. 
In this case it has been shown [43| that in the thermody- 
namic limit and for T — >■ the attractor configurations 
of the network exactly match the stored patterns, and 



all have the same depth in energy. With correlated at- 
tractors, instead, when a memory is retrieved there is a 
partial activation of some of the other memories, roughly 
proportional to their correlation with the retrieved one. 
The energy minima then have a distribution in energy 
around the mean value. 

Approaching the capacity limit one arrives to a con- 
dition where the basins of attraction of different memo- 
ries overlap and eventually fracture and coalesce, form- 
ing new "spurious" stable states [48|. The main effect 
of correlations is similar, to generate overlaps among the 
attractors. This makes it more difficult for the network 
to retrieve the stored patterns. 

Fig. [S] shows the difference in p^ for sets of patterns 
with three different correlation levels, as obtained from 
computer simulations. From the figure, one sees that the 
presence of correlations, generated by the multi- factorial 
algorithm, decreases significantly the capacity of the net, 
when a — >■ 0, with respect to the randomly correlated 
case. One notes that, conversely, the effect of correlations 
is negligible for non-sparse patterns, a > 0.5, but in this 
range the actual capacity is sensibly lower than predicted 
by the asymptotic formula. 
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Figure 5: (Color online) Storage capacity of a network oi N = 
600 units with 5 = 5 active states as a function of its sparsity 
a for three sets of stored patterns: a set of randomly generated 
patterns {RC: (^ = 0.0, flp/ = 0.0, top solid curve), one of 
weakly correlated (LC: C, = 10~^,ap/ = 0.4, intermediate) 
and one of highly correlated {HC: ^ = 0.1, Sp/ = 0.4, bottom 
cuve). The dashed curve shows the theoretical value predicted 
by Eq. 1161 in the a ^ 1 limit. In these simulations, U = 0.5, 
w = 0, C = 90 and N = 600. 



elicited by cued retrieval. The initial retrieval is trig- 
gered here by a transient signal in the direction of a mem- 
ory pattern, then the system is left to evolve subject to 
its own dynamics. What one may observe is shown in 
Fig. [TJ After the first cued retrieval the system adapts 
and makes a transition to a new attractor. This new re- 
trieved memory adapts in its turn, and acts as a cue for 
a new transition. Such a behavior is spontaneous and is 
due just to adaptation and to the correlation among the 
patterns. In these simulations, transitions are not due 
to noise since, at the level of interactions among Potts 
units, the dynamics is deterministic and no noise is in- 
cluded in the equations {T represents instead noise inter- 
nal to each patch, in the interactions among the neurons 
subsumed into the effective Potts description). At the 
same time the sequence of retrieved patterns is not de- 
fined ad hoc, and it does vary with any small variation 
of the initial conditions. Finally, one may notice that 
each of the retrieved patterns can appear multiple times 
in the retrieval sequence, as the increase in the thresh- 
olds is just temporary, and each unit can be reactivated 
and/or return to its former activity state, in due time. 

Exploring the space of parameters, a variety of dynam- 
ical behaviors are qualitatively distinguished. For some 
parameters, that we call the finite latching region, the 
complex dynamics, described above as latching transi- 
tions, die spontaneously after a certain number of steps 
and the system relaxes into its inactive configuration, the 
only truly stable one for the network (Fig. HUb)). From 
this region, moving a bit in phase space, i.e. modify- 
ing the values of some of the parameters, it is possible 
to induce an increase or a reduction in the length of the 
latching sequence, and to approach either of two extreme 
conditions: a no latching region, where the system is able 
to recall the cued memory but not to express any asso- 
ciative transition after that (Fig. [Ha)), and an infinite 
latching region, where the latching process self-sustains 
indefinitely in time (Fig.l^c)). 

What makes latching begin, and what makes it even- 
tually stop? We first present a qualitative description of 
the different regimes, or phases, and then analyze their 
boundaries, or phase transitions, in the simplified situ- 
ation in which the stored patterns are uncorrelated or, 
to be exact, just randomly correlated. Structured cor- 
relations are then reintroduced by means of computer 
simulations. 



A. The onset of latching 



III. LATCHING SEQUENCES 

Once randomly or nonrandomly correlated patterns 
are stored, one may study the dynamics of the network 



In [45| we focus on the dynamics of a single latch- 
ing transition, in order to clarify the internal mecha- 
nisms that enable the retrieval of a second pattern and 
to study qualitatively the variety of transitions arising 
from these dynamics. The analysis shows that, apart 
from the somewhat pathological case of oscillations be- 
tween nearly overlapping attractors, latching transitions 
may be distinguished between random ones, dubbed low 
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transitions for the low value of the overlap between the 
latched patterns, and those driven by positive correla- 
tions, high transitions. 
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Figure 6: (Color online) Latching transitions (darker, red 
dots) increase rapidly when the local feedback coefficient in- 
creases from w — 0.35 (left) to to = 0.45 (right). In either 
case, in the Cas — Cad correlation plane, transitions occupy 
a sub-region of that spanned by correlations among all the 
stored attractors (lighter, blue dots): they tend to occur only 
between pairs of patterns with a fraction of active units in 
the same state above the average value Cas ~ a/S (vertical 
line) and a fraction in different states below the average value 
Cad = a{S — l)/5 (horizontal line). In these simulations, 
a = 0.25, S — 7, and the stored patterns were only randomly 
correlated. 



Following this line, we produce p retrieval sequences 
by initially cueing each time a different stored attrac- 
tor. Fig. [5] shows, for those cases in which a transition 
is produced, the correlation parameters Cas and Cad be- 
tween pairs of latched patterns (red dots) with respect 
to all the pairs (blue dots). Cas is the fraction of units, 
among the Na active ones in the first pattern, that are 
also active in the second and in the same state, while 
Cad is the fraction of the same units that are also ac- 
tive, but in a different state; the complementary fraction 
1 — Cas — Cas are not active in the second pattern of the 
pair. One can see that, in agreement with [45|, the tran- 
sitions cluster in a region of the correlation plane. With 
these parameters (in particular, a relatively high thresh- 
old U) , despite the presence of many patterns sharing a 
low value of Cas, no "low" transitions are observed. The 
percentage of patterns that give rise to a latching transi- 
tion clearly increases going deeper into the finite latching 
region (Fig. [n]Jb)). 

Rather than asking whether transitions occur with a 
given correlation between the latched patterns, however, 
we want to focus here on how their occurrence depends 
on the parameters of the network. In order to study 
more systematically the border between the no latching 
and the latching regions, in particular, we vary the num- 
ber of active states S, the number of stored patterns p, 
the network connectivity C and the effective temperature 
T, and extract from simulations the lowest value of the 
local feedback term w that enables latching transitions 
(Fig. [7]). Note that if these structural parameters are 



such that the network straddles the no latching and the 
finite latching region, transitions may occur or not, for 
the same parameters and even for the same set of pat- 
terns, depending e.g. on the initially cued configuration. 
This variability generates the error bars in Fig. [71 
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Figure 7: The minimal value of the local feedback coefficient 
w required to start latching is found with simulations to grow 
with T (sub-linearly) and with S (linearly or slightly supra- 
linearly); and to grow approximately with 'Jcfp. The stor- 
age load (p) thus facilitates latching, whereas local noise, the 
number of states and the connectivity all obstruct it. When 
not varied explicitly, the parameters are set at p = 200, S = 7, 
C = 90, Af = 600 and T = 0.08. 



B. Finite latching 

To better understand how network parameters affect 
sequence length, we fix a set of parameters and let the 
system evolve until it reaches a quiescent condition. If, 
after 6 • 10^ updates of the whole network, equivalent to a 
time 1.8 • 10^ ti, the system is still active, the simulation 
is terminated. 

Fig. [8] shows the average length of the sequences so 
generated, in relation to the variation of several network 
parameters: w, 5, C, T, T2 and T3, while the dependence 
on p is shown in the next Figure. 

Focusing on the parameters defining network archi- 
tecture, it turns out that an increase in memory load 
(Fig. Ufa)), a reduction in the number of states of each 
unit (Fig. [Bfa)) and a reduction in the connectivity 
(Fig- Eld)) all give rise to an increase in sequence length. 
Such increase is also produced reducing the effective tem- 
perature T (Fig.[5I^c)) as well as increasing the local feed- 
back term w (Fig. [5{b)). It is important to note that, for 
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Figure 8: Sequence length (in units of ri) as a function of 
various parameters. The connectivity C, number of attractor 
states S and local noise T are confirmed to obstruct latching, 
whereas larger values of the local feedback w (and storage load 
p, see Fig. [^a)) prolong it. Moreover, the sequence is found 
to be stretched almost in direct proportion to the general 
adaptation time scale rs. When not varied explicitly, the 
parameters are set at p = 200, S = 7, C = 90, N = 600, 
T = 0.09, w = 0.8, n = 3.3, T2 = 100 and T3 = 10^ 



each of these parameters, there is a critical value beyond 
which no latching is observed - and often another value, 
as seen in Fig. ^ beyond which latching appears to pro- 
ceed indefinitely. 

Finally, simulations indicate (Fig. [5I^e, f)) that, while 
the sequence length tends to a plateau by increasing the 
specific adaptation time scale T2, there is a monotonic, 
quasi-linear dependence with the generic threshold adap- 
tation time constant, t^. 



C. Within the finite latching region 

Once the parameter set is fixed, the length of the latch- 
ing sequence is still not entirely defined. There is indeed 



variability due to the initial conditions. As previously 
mentioned, it is enough to slightly jitter the cued con- 
figuration to change completely the sequence of retrieved 
patterns. While considering the dependence on the mem- 
ory load p, we focus then on the distribution of lengths 
seen even for fixed parameters, and due to different initial 
conditions. 

In order to explore the whole configuration landscape, 
after the storage of the memory patterns, we cue the 
network in the direction of each of the p patterns, in 
turn. 

In Fig. [HI one can see the distribution of sequence 
lengths as the network moves from the no latching con- 
dition towards the indefinite latching region. Emerging 
from the no latching phase, as p increases the network be- 
gins to latch, initially with only a few jumps between at- 
tractor states (the mean sequence length is so short that 
it is hard to distinguish it from the x-axis). In this con- 
dition, the variability in sequence length is spread with a 
mode close to zero (the observed gap of 200 time steps is 
approximately the time for the first retrieval to be exter- 
nally triggered) and a roughly exponential tail (Fig. [H^b, 
c)). As the average length increases, however, the dis- 
tribution moves from the exponential to a more normal 
shape (Fig. [Sj^d, e)), suggesting a simple relaxation pro- 
cess with a characteristic time scale, which turns out to 
be just Ta (Fig.[5Jf)). During this relaxation the system 
retains a simple memory of the time spent latching, resid- 
ing in the mean value of the generic adaptation thresh- 
old, 0°. With time constant T3, in fact, 9'^ approaches 
its asymptotic value, eventually terminating the latching 
sequence, when on its way it crosses a certain critical 
value. 

Although so far we have discussed the slowly adaptive 
regime of Sec. Ill CI the complexity seen in Fig. IHfa, b, 
c, d, e) makes it now convenient to review the sequence 
length distribution in the rapidly adaptive regime. As 
explained in Sec. IIIBl in such a regime T3 is a fast time 
constant which can be interpreted as characterizing the 
immediate regulation of the mean activity level locally 
in the cortex, by fast GABA^i inhibitory mechanism. In 
such a regime, as prescribed by Eq. |S1 the threshold 6^ 
imposed on each Potts unit tracks the activation level 

J2k=i '^i °f ^^^ ^'^i^ itself with speed I/T3. Then, by the 
time the next latching transition occurs, the threshold 
almost equals the unit activation itself, and it is removed 
as a distinct variable, thus simplifying the dynamics of 
the system. 

Despite the increase in the sequence lengths, common 
to both regimes, in the rapid adaptation regime we find 
a simpler scenario. For a fixed p the length distribution 
appears quasi-exponential (Fig. IHl^g, h, i, 1)). This trend 
is characteristic of an essentially "memory-less" process, 
in which at each step the probability to fall into the qui- 
escent state, and terminate the sequence, is roughly the 
same. For higher p, (Fig. [Hlji, 1)), one notes the appear- 
ance of potentially "infinite" sequences, although we ter- 
minate them after 3 x 10^ steps, thus making the average 
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Figure 9: Sequence length, in units of n, with respect to the number p of stored patterns when the network is in a slow 
adaptation regime (ri = 3.3, r2 = 100, ra = 10^) (a) and in a fast adaptation regime (n — 20, T2 = 200, ra — 10) (f). (b, c, d, 
e, g, h, i, 1) Distribution of sequence lengths for a fixed p (at the values indicated by the stars). In the slow adaptation regime, 
at latching onset, the distribution of lengths is strongly skewed towards low values, with a tail approaching an exponential or 
Poisson shape (b, c); as p grows and the mean length increases, the distribution is increasingly normal (d, e). The crossover 
appears to occur when the mean length starts to increase strongly, and quasi-linearly, with p. In the fast adaptation regime, 
instead, one portion of the distribution remains quasi-exponential (g, h, i, 1), but the error bars increase for p > 80, due to 
the inclusion in the distribution of an increasing fraction of trials with an undetermined length, because they were terminated 
upon reaching the maximum allowed duration of 3 x 10*ti (the occurrence of such trials is denoted by empty circles; allowing 
for longer duration did not increase the quasi-exponential portion appreciably, given p). 



length finite. For the same sets of patterns, depending on 
the initial condition, hence on the landscape visited by 
the dynamics, finite and infinite (manually terminated) 
latching sequences co-occur. The increase in the average 
length for higher p is then due both to an increase in 
the length of finite sequences and to an increase in the 
proportion of terminated sequences (not shown). 

The quasi-exponential length distribution earlier ob- 
served in the slow adaptation regime close to the no- 
latching region might have been interpreted as resulting 
from the tail of a normal distribution, when its mode 
would be below zero, after its negative component is cut 
off or squashed towards zero. On the other hand, in the 
fast adaptation regime, the quasi-exponential tail is all 
that is seen, because when latching does not stop in a few 
steps, it proceeds indefinitely. But what can terminate 
it in a few steps, in either regime? The large variability 
indicates a process, in which the network sometimes falls 
in a "trap", i.e. in the basin of attraction of one or few 
"isolated" memory patterns, out of which it cannot latch 
further. 

To test this hypothesis, for a fixed set of patterns we 
have run extensive simulations, keeping track of the last 
attractor visited before latching subsides and the network 
falls in the globally quiescent state. 

Fig. [in] shows for each pattern the probability to be 
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Figure 10: Probability of a pattern to be the last of a latching 
sequence (Pstop) vs. its probability to occur at any position 

(P, 

adaptation regimes 



pass) for the fast (left, p = 100) and slow (right, p — 200) 



the last in a latching sequence (Pstop), with respect to its 
probability to occur at any position in a sequence (Pjjass)- 
In the slow adaptation regime of Fig. [101 right the pat- 
terns have a similar probability to occur as the last of a 
sequence. In contrast, in the fast adaptation regime of 
Fig. [ini left there seem to be some "preferred" patterns 
where latching often terminates. Since those patterns 
are not apparently more likely to be visited than others 
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before a sequence ends (the relative Ppass is not higher 
than for other patterns) there must be something in their 
correlational surround that makes them less able to cue 
the retrieval of other attractors, so that dynamics often 
stop there. These considerations suggest that the transi- 
tion to infinite latching occurs when from each basin of 
attraction a "border crossing" is open to another basin 
of attraction that the system can latch to, when slipping 
away of the former. 



D. Sustained latching 



X 10 




ity limit, although somewhat ill-defined in a network of 
moderate size, appears to be crossed before the infinite 
latching region is reached. If this happens, the system 
sustains an indefinite noisy activity, without ever stop- 
ping and without fully retrieving any pattern. For ex- 
ample. Fig. [TT] shows the average sequence length, if not 
terminated at 3 x 10* ti, and the faction of patterns re- 
trieved with an accuracy higher than 90% (a measure of 
retrieval quality) if network dynamics are initiated by a 
partial cue. When S = 7 one can see that the system 
reaches the infinite latching region around p = 50, while 
retrievability decreases beyond p ~ 120. Therefore there 
is a window in the storage load range, in which the system 
really latches from a retrieved memory to the next, in- 
definitely. When S = 3, instead, the ability to retrieve a 
cued pattern starts to decline already around p — 8, when 
the length of the latching process is still finite and in fact 
short. Therefore, for S = 3 and with this set of param- 
eters, the network does not ever really latch indefinitely, 
but rather enters a region of indefinitely protracted noisy 
dynamics. 



E. Rapid adaptation: a percolation type of 
transition 



Figure 11: (Color online) Average sequence length (orange 
ascending curve; left scale, in units of ri) and retrieval qual- 
ity (blue descending curve; right scale) as a function of the 
storage load, for S — 3 (above) and S — 7 (below). 



As introduced in the previous paragraph, moving fur- 
ther in parameter space, beyond the finite latching re- 
gion, one seems to enter a domain of infinite sequences, 
where each latching step is followed by another one, un- 
til the simulation has to be terminated. It is important 
to note that while approaching such infinite latching, the 
network is also moving towards its storage capacity limit. 
For some values of the parameters, indeed, the capac- 



The observations of Sec. IIIICJ suggest that latching dy- 
namics stop when the network runs into an attractor with 
no "border crossings", or links, to other basins of attrac- 
tion. Exploring this hypothesis is, however, complicated 
by the presence of adaptation that continuously modifies 
the basins and their boundaries. 

We consider here the rapidly adapting regime, ra <C 
Ti < T2; in which the generic threshold adapts so 
rapidly that it can be considered as a function quasi- 
instantaneously set to the value imposed by the {el's, 
as if due to rapid inhibition. In this case, the effective 
energy minimized during the rapid transition dynamics 
does not depend on the 6^'s any longer, only the ^'^'s for 
k ^ enter as parameters, and the dependence on the 
(j'^'s is modified by an initial term 
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necessary to take into account the fast 9^ dynamics. tion of 6^ (while 9^ is kept, though hidden as a function 

of the {a-}'s). 
To get an impression of the energy landscape visited 
by the system, without the distortion of adaptation, we Fig. [T^ shows the overlaps of the svstem with the stored 

consider also Eiand, that is. Eg. 1 171 without the contribu- patterns and the two energy values as a function of time 
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Figure 12: (Color online) Latching transitions and relative 
value of the total energy {E, higher, red curve of the bot- 
tom graph) and of the landscape energy [Eiand, lower, purple 
curve). At each attractor transition, the energy decreases 
while Eland has a corresponding peak. 



(expressed m sec, assuming ri = lOmsec). In agreement 
with Sec. IIIBI there is a drop in the energy (red line) 
every time the system has a latching transition, however 
noisy and far from the ideal of allowing only two non- 
zero overlaps the transition may be. At the same time, 
though. Eland) the energy related to the landscape (pur- 
ple line), has a peak. This suggests the presence of an 
energy barrier that has to be passed each time the system 
changes basin of attraction. If this hypothesis is correct, 
we expect the increase in sequence length, approaching 
the infinite latching region, to be due to a decrease in 
such barriers. We run then new simulations, crossing the 
transition to infinite latching by increasing the number 
of stored patterns. 




Figure 13: (Color online) Landscape energy profiles (Eiand) 
for a network with load p = 20 (light blue) and p = 80 (darker 
blue). The increase in the number of stored patterns reduces 
the highest energy boundaries between attractors. 

Fig. [13] shows the energy profile of the system when 
set below its infinite latching transition (light blue) and 
above (blue). Comparing the two curves, no major differ- 
ence is visible between the energy levels of the attractors, 
i.e. the bottom of the valleys. This appears to exclude 
a scenario in which the ability to produce an infinite se- 
quence is due to the presence of deeper minima. On the 
other side, by increasing the number of stored patterns. 



the energy boundaries which separate two attractors de- 
crease, especially the highest ones. This confirms our 
hypothesis and suggests that what actually makes the 
system stop in a particular attractor is the height of the 
(landscape) energy boundaries surrounding it. 

We can then imagine the energy landscape of the sys- 
tem as a high-dimensional manifold, where the presence 
of stored patterns is marked by valleys, that correspond 
to their basins of attraction. 



&^ 




Figure 14: (Color online) 2-dimensional sketches of the energy 
landscape, (a) When just a few attractors are stored, the 
system retrieves memories but it is not able to latch, (b) If 
the number of attractors increases, the basins connect one to 
the other, allowing some latching transitions, (c) A further 
increase in the number of stored patterns leads to a condition 
in which, irrespective of the starting point, the dynamics go 
on indefinitely in time. 



When a few sparse attractors are stored (Fig. [HFa)) 
the network, if cued towards one of those, works correctly 
as a memory but without performing any latching tran- 
sition. The large distance among patterns, indeed, does 
not allow latching to a new attractor, and once exited 
from the first attractor the network remains permanently 
in its stable inactive state (Fig.[Tl^a)). 

If we add new patterns, however, some of their basins 
of attraction connect (Fig.[T4fb)). This lowers the energy 
boundaries enough, allowing the system to make transi- 
tions and creating privileged paths in the landscape. A 
similar condition is visible in Fig. [TJb), where the system 
shows latching dynamics, that remain however limited in 
time. The refractory period due to adaptation, indeed, 
impedes a new immediate recall of the first memory after 
the retrieval of the second one. In order to have consecu- 
tive transitions, an attractor needs then at least two other 
correlated attractors, the previous and the following one. 
The probability of such situation to occur increases with 
the number of stored attractors, and its inherent vari- 
ability produce a consequent variation in the length of 
the latching sequence. 

By adding new valleys we arrive finally to a state in 
which, starting from whatever point in the manifold, the 
system can always find an infinite path across the land- 
scape (Fig.fnrc) and Fig. [IJc)). This behavior is typical 
of a percolation phase transitions [49l. |50|. The increase 
in the probability of a border crossing, which is higher 
the more crowded and the wider are the basins corre- 
sponding to different memories, and which increases the 
average sequence length, can then be regarded as the 
probability of a link in a graph, where the nodes are the 
attractors. Infinite latching might then require, as in a 
percolation phase transition, this probability to be above 
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a certain critical value (leading to the formation of a so 
called giant component). 



IV. AN APPROXIMATE ANALYTICAL 
APPROACH 



as Ja, while the variability is due mainly to the storage 
of the other memories, and to the incomplete connec- 
tivity, Cij 7^ 1, which makes the signal (and noise) vary 
depending on which are the exact presynaptic units. The 
standard deviation of this variability is denoted here as 



In order to understand the emergence of distinct 
phases, let us consider the time derivative of the energy, 
Eq. [TOl when the network is still close to a given attrac- 
tor ^f^ and a single unit is changing state, pretending to 
be in an ideal situation with only clean latching transi- 
tions from a single active attractor to the next. We stay 
now within the slowly adaptive regime. We focus first 
on a unit i which is currently inactive, i.e. ^f = and 
af ~ 1, at the moment just before it starts to increase 
its activation in a particular direction fc, so that a mo- 
ment later daf /dt > 0. Clearly, the change will lead to 
a decrease in energy if dEi/da^ < 0, and in that case it 
is favoured, provided the thresholds can be assumed to 
stay roughly constant. When still at equilibrium, how- 
ever, dEi/da^ ~ (in all S directions), and in fact the 
small activation values a^ ~ of the unit are determined 
by imposing that all such first derivatives vanish. 

Considering that af = 1 — J^i <^\ i the first derivatives 
are calculated as 
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The terms after the first, in the partial derivatives, in- 
clude some (the thresholds) that do not depend on trf, 
some (those proportional to w) that are only linear in af , 
and the entropy terms, proportional to T, that are highly 
nonlinear in cr^', going from — cx) to -|-oo as a'^ goes from 
to 1. The first term, that reflects the coupling to the 
other units, — 'J2jm X];=i Jij'^\^ i^ constant with respect 
to the activation values and thresholds of unit i, but it 
varies a great deal from unit to unit, even among those 
we are focusing on, inactive in the current configuration 
and which are activating if a transition occurs. It can 
be estimated through a signal-to-noise analysis, by con- 
sidering separately in the couplings J^^ the contributions 
due to the storage of the current memory attractor, /i, 
of the attractor after the transition, v, and of all oth- 
ers, Q ^ [i,v. The signal (largely consistent across those 
units) is due to the first two, and its average is denoted 



A. The signal and noise from the network 



The signal may be estimated as 
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where Vas-Xad and V^q measure the actual correlation 
between the current and the forthcoming attractor, rel- 
ative to their expectation values if they were randomly 
correlated. Specifically, they denote the ratios between 
the number of units presynaptic to the given unit that 
are active in the current attractor and are active in the 
same state (Fas), in a different state {Vad) or inactive 
(Fao) in the forthcoming attractor, and their network 
averages, which can be estimated as follows (in the case 
of randomly correlated patterns) 



Fa. = 


= Nasi < Nas > 


Fad = 


= Nad/ < Nad > 


FqO = 


= Nao/ < NaQ > 



<Nas> = Na< Cas >= Na^/S 
<Nad> - Na<Cad>^Na\S-l)/S 
<Nao> = Na<l~Cas- Cad >= Na{l - a] 



(21) 



Therefore on average Fas — Fad = Fao = 1 and the only 
signal comes from the current attractor (the first term 
in Eq. [^0]) . As illustrated in Fig. [51 however, latching 
transitions are favoured among attractors with Fas > 
1 and Farf,Fao < 1, since these values bring down the 
energy derivative. In particular the major contribution is 
due to the Fas term (which is the one that most deviates 
from unity when correlated patterns are generated by 
the multifactorial algorithm, see Fig. [4]). For p random 
patterns, the one with the largest number of shared active 
units in the same state as the current attractor can be 
estimated, from the tail of a Poisson distribution, to have 
an excess number that roughly satisfies the equation 
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(22) 



For example, if Fas — 3 and Fad — Fao — 1, we have 
— Jct ~ —a/S, that is, the excess correlation with the 
forthcoming attractor is sufficient to switch the sign of 
the signal from positive (resisting change) to negative 
(favouring change). 
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The variability across units is due to several sources, 
the most important of which are the presence of the other 
stored patterns, and the incomplete connectivity. Both 
these factors generate terms that are on average zero 
across units, but with a variance that can be estimated 
to be, respectively. 
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which leads to a contribution, taking one standard devi- 
ation in the downward direction favouring the transition 
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B. The single unit terms 

The variability in the coupling term across the S di- 
rections also gives the different activation values 



exp/3(Jcr± A,/-C/ 



(26) 



which are all small if the unit is largely inactive, with 
the signal Ja insufficient to overcome the threshold U. 
Given the exponential dependence on the fluctuations in 
the coupling, however, one activation will typically be 
much larger than the others, which can be taken to hover 
around their mean value ct ~ 0. The stability to fluctu- 
ations that activate the unit further in the direction in 
which it is leaning is determined by the stability matrix 
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When the active states are all activated with the same 
amplitude, the stability matrix has one eigenmode corre- 
sponding to the increase of the common amplitude, which 



is safely stable, and 5' — 1 degenerate eigenmodes cor- 
responding to increasing amplitudes in each direction. 
When instead one state is already more active, and the 
others are close to tr, one of the S — 1 eigenmodes be- 
comes unstable earlier, and it is the one proportional to 
Sik + (Sik — l)e''/S', where we assume an instability in 
the direction 1 and the small corrections e*^ oc a'^ for 
k ^ 1 express the difference between considering the 
full stability matrix and simply the second derivative 
S^ Ei{{aY^}) / {Sajy . The eigenvalue A satisfies the equa- 
tions (dropping the unit index i, and for all fc 7^ 1) 

/3A = - I3w{l -^) + — + — -E 0^ ; + 
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where the equation after the first can be taken to deter- 
mine the {e'^l's as a Taylor expansion in the {cr'^}'s, while 
the first determines the eigenvalue A to zero'th order in 
the expansion, as a function of cr^, and, combined with 
the other equations, the following orders. 

At the instability, A = 0. This determines e'' to be, to 
first order in cr''", 



e'^ ~ cr*^ {(5w + 23/ a") 
and gives the further condition 
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that has to be satisfied together with 5Ei/6<7^ ~ and, in 
principle, 5Ei/5a^ = for A: 7^ 1 to yield o'^,(t'^ and the 
dependence of e.g. w on all other parameters at the phase 
transition. In practice, if one can approximate cr'^ ~ for 
fc 7^ 1, one needs to solve only Eq. [31] and SEi/Sa"^ = 0, 
an approximation which can be improved by inserting a 
small correction term. 



C. Transitions 

It is now possible to interpret the distinct phases in- 
troduced in Sec. Illll with the analytical tools just estab- 
lished. 



15 



The ability of the system to perform or not a transition 
is tightly linked with the possibility for some of the inac- 
tive units to flip in an active state and trigger a cascade, 
leading to an attractor transition. 




Figure 15: (Color online) Partial derivative of the Energy 
Functional with respect to ct*. (a) Example of derivative 
which allows (lower, blue curve) or not (upper, red curve) 
latching transitions, e.g. because of different values of 6'"; (b) 
derivative in the cases of ui = 1.0 (upper, red curve), w — 1.5 
(intermediate, purple curve) and w = 1.9 (lower, blue curve), 
with the other parameters set at S = 9, N = 600, p = 140, 
C = 90, f/ = 0.1, a = 0.25 and /3 = 5. The light green curve 
is the stability equation, Eq. 1271 



In Fig. fTST a) we plot Eq. [1^] with respect to a'^ , for 
two different parameters sets (blue and red curves). In 
the case of the blue curve the energy derivative is nega- 
tive until the unit reaches a complete activation. For the 
red curve, instead, it becomes positive already for small 
values of af. The i unit then, inactive in the retrieved 
attractor, will activate only if the local maximum of the 
energy derivative is under the x-axis, as in the blue case. 
This condition occurs only for some values of the param- 
eters. Eq. [in] depends, indeed, explicitly on w, S, U, 

T and implicitly, through J^j^i J2i=i Jij'^]^ o^^ "' ^^ P^ 
N . The variation of any of them moves and deforms the 
curve, producing the distinct phases presented in Sec. lIIII 
Among all the parameters, we focus now on the w-T 
dependence. 



D. The w-T phase space 

Through computer simulations we explore the w-T 
phase space. As shown in Fig. [T5l b). the qualitative 
effect of w on the energy derivative is, if increased, to 
decrease the local maximum and, much more, the mini- 
mum of the function. 

Fig. [TC] shows that, when changing the value of w, 
the system crosses several possible dynamics scenarios. 
Increasing w, the system is, respectively, able to retrieve 
a cued pattern but not to latch to other attractors {no 
latching phase); capable of a finite number of transitions, 
before getting trapped in the stable silent configuration 
{finite latching phase); capable of an infinite number of 
transitions {infinite latching phase); or trapped in the 
retrieved configuration {stable attractor phase). All this 
phase transitions correspond to critical conditions in the 




Figure 16: (Color online) The w-T phase space, in the slowly 
adapting regime. Dots refer to w values, from simulations, 
that correspond to the beginning of the latching region (red, 
left), of the infinite latching region (purple, middle) and of the 
stable attractors region (blue, right). In these simulations 
S = 7, iV = 600, p = 140, C = 90, a = 0.25 an d U = 
0.1. Solid curves refer to the analytical result (see Sec. lIVE|l . 
while the dotted curve is just a guide for the eyes. Note 
that, in this regime, the m-interval between latching onset 
and the putative percolation transition, to latching sequences 
of duration proportional to rs, is quite short, within the "error 
bars" of the simulation data. 



energy description. 

No Latching - Finite Latching transition. It corre- 
sponds to the marginal condition in which the maximum 
is exactly on the x-axis (Fig.fTTTa)). For lower values of 
w, the derivative is positive and the system cannot latch; 
for higher values, the derivative is negative and the 
transition occurs. Whether the maximum is on the axis 
or not may depend of course on the initial condition, in 
particular on which pattern is retrieved by the external 
cue. If it is an early latcher, so to speak, the network 
may not continue to latch, or it may latch for only a few 
other steps. If instead w is large enough to allow most 
attractors to latch, latching will typically last longer. 
Eventually, all attractors will lead to latching. If this is 
the case, transitions come one after the other, until the 
slow 6'?, originally set at 0, slowly increases (as we are 
focusing now on the slow adaptation regime in which 
T3 > T2 3> Ti), bringing up the whole curve. As soon 
as the maximum reaches the x'-axis, no unit is able to 
flip any longer, and transitions stop. The w-interval 
between early and late latchers is quite short, as seen by 
the standard deviation in the Fig. [TCI 

Finite - Infinite Latching transition. Since the generic 
threshold 0^ can take as maximal value 6*° = 1, beyond 
a certain value of w the growth of 0° is not sufficient to 
ever make the maximum positive. For each value of the 
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temperature T, then, there is a critical value w, beyond 
which latching transitions never stop. 

Infinite latching - Stable Attractors transition. The 
increase of w, however, has the simultaneous effect of 
stabilizing network attractors. Therefore, increasing the 
self-reinforcement term the system reaches a condition 
in which, after the first retrieval, it remains trapped in 
the attractor. Analytically we can treat this last case 
similarly to the previous one, with the only difference 
that the energy derivative, formally identical to Eq. [T91 
is now taken with respect to the activation value of an 
active unit. From Fig. [TTT b). it is possible to see that a 
generic active unit is stable until the Energy derivative 
has its minimum under the x-axis. Then, when the adap- 
tive threshold 0f increases, following the unit activation, 
it translates the curve upwards, inactivating the unit as 
soon as the local minimum crosses 0. If w, however, is 
too large, the translation due to 6^ is not sufficient for 
the curve to cross the x-axis. There is then a critical 
value for w, beyond which all network attractors become 
stable (Fig.ITHa)). 
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Figure 17: (Color online) Partial derivative of the Energy 
Functional, mean value (sold curve) ± standard deviation 
(dotted curve), (a) Critical condition in which some inac- 
tive units (lower, purple curve) are able to flip to an active 
state and guide a transition; (b) critical condition in which 
some of the active units (upper, purple curve) get deactivated, 
thus destabilizing the attractor. 



E. Analytical Curves 

Having set up an analytical framework, we can now try 
to explain the boundaries between the different dynam- 
ical phases obtained through computer simulations, and 
shown in Fig. 1161 

As described in section llVBl at the instability, Eqs.fTOl 
and [3l] should be solved simultaneously, to yield the value 
of a!^ and a relation between the parameters, expressing 
e.g. w as a function of the others. 

Setting all other parameters, and imposing conditions 
appropriate to the different phases, it is then possible, to 
solve the system numerically and find the critical values 
Wc at the phase boundaries. Before analyzing individual 
phase transitions, we can make a general approximation. 



whether unit i is active or inactive and k its future or 
present activation state: we take erf -|- cr° ~ 1. The en- 
tropy term of Eq. [1^] can then be approximated as 
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(32) 



No Latching - Finite Latching transition. The units 
that lead this transition are those inactive in the cur- 
rent attractor, that are activated in the next attractor. 
Among those units, the first to be activated are those 
that were inactive also before the current attractor, e.g. 
in the stable condition in which the system is prepared 
before the external cue that triggers the initial pattern 
retrieval, so that their 0'^ and 9^ thresholds are still at, 
or close to, 0. In the unit-to-unit variability expressed by 
the fluctuations around the mean values in Fig. [T71 these 
units are at the lower end of the range: their activation 
then reduces the signal, i.e. lowers 6Ei/Sa^, for other 
units, which follow suit. To estimate where is the lower 
end of the fluctuation range, one may consider the vari- 
ability expressed by Eq. [25] and the number of units to 
be activated, which are a subset of the Na units active in 
the next attractor -those not active already. We find em- 
pirically that a good approximation to the Wc{T) curve 
derived from the simulations is obtained by inserting a 
coefficient (possibly dependent on N) in front of the s.d. 
of the signal, and applying a correction to the fraction a 
of units to be activated, dependent on w and T, so that 
the mean signal is lowered by a term 
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that we leave for future studies to fully justify. The 
numerical curve thus obtained is in reasonable agreement 
with that derived from simulations, as shown in Fig. 1161 
Note that, except for higher T values, the discrepancy 
between the curve thus obtained and simulation data is 
within the range of variability of the latter, i.e. within 
the difference between the very earliest latching, and the 
putative percolation transition. 

Finite - Infinite Latching transition. The boundary 
between the finite and infinite latching regions is con- 
ceptually of the same nature, except that the phase 
transition occurs when 5Ei/da^ is lowered so much by 
the larger w value as to compensate for the thresholds 6*° 
and O'^, which have grown during the latching sequence. 
Since their final values are difficult to determine, we 
do not attempt an analytical derivation of the phase 
boundary. 

Infinite latching - Stable Attractors transition. The 
transition from unstable to stable attractors is related 
to the inability of any active units to be deactivated by 
the increase in the adaptive threshold. If a few units go 
to the inactive state, others follow, with the same chain 
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reaction process seen above. Therefore the condition for 
attractors to be stable is that no unit should be able to 
inactivate. The limit hence correspond to the condition 
in which, with maximal threshold, and 0^ = 9^ = 1, 
the minimum of Eq. [12] lies on the x-axis. Again, one 
estimates the height of the 5Ei/5a^ curve for the early 
deactivating units from the amplitude of the fluctuations 
in Eq. 1251 The result of this approximation is the blue 
curve in Fig. 1161 To produce this curve, we used directly 
mean 9 values resulting from the simulations. 



F. Adapting the analysis to the fast adaptation 
regime 

The above analysis can be extended to the situation 
in which the specific thresholds 9^ continue to evolve on 
a much slower time scale than the activation values cr, , 
but the generic thresholds 6*^ evolve faster, and the limit 
can be considered in which they adapt almost instanta- 
neously to the activation values. Strictly in that limit, 
it was shown earlier that one can consider the modified 
energy functional of Eq. [T71 where the 0° have been inte- 
grated out, and there is an additional term (^^, erf )^/2 
for each unit, to account for their rapid adaptation. A 
pre-factor /i < 1 may also be inserted before the addi- 
tional term, to take into effective consideration a non- 
instantaneous adaptation of the generic thresholds, i.e. 
when Ta < ri, but still the two time scales are compa- 
rable. It is simpler, though, to restrict ourselves to the 
Umit Ts <C Ti <C T2 . 

In this limit, the partial derivative of the energy func- 
tional with respect to any of its activation values remains 
numerically the same as in Eq. 1191 with the difference 
that the simple term 9^ now stands for ^^ erf. As a con- 
sequence, the second derivatives that enter the stability 
matrix change, and they all acquire an additional term 
equal to 1 . The components of the unstable eigenvector 
in the non-preferred directions are modified into 

e'^~cr''^(^w-f/3S' + 25/(7°) +0(^2); (34) 

and, importantly, the stability condition now reads, to 
leading order. 
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The additional /? term in Eq. [3S] implies that the green 
curve in Fig. [15] is much higher, hence it takes a higher 
value of the local feedback coefficient w to reach a phase 
boundary. The phase diagram of Fig. [16] is basically 
shifted to the right. Moreover, in the fast adaption 
regime there is essentially no finite latching region: it 
is squeezed into a tiny transition strip between the onset 
of latching, when starting in a favorable initial attrac- 
tor, and the percolation-type transition to infinite latch- 
ing, when latching proceeds from any attractor. These 
analytical expectations are well matched by simulation 



results (not shown), although a quantitative agreement, 
when e.g. T3 = 0.3 • Ti does not really vanish, requires 
inserting a pre-factor to multiply the new term, as indi- 
cated above, e.g. /i ~ 0.8. 



G. Latching with correlated patterns 

Whereas to treat the fast adaptation regime one has 
to modify the single unit terms in Eq. 1191 to describe 
the phase diagram appropriate to the storage of corre- 
lated attractors one has to consider an enhanced vari- 
ability, due to correlations, in the local field from the 
rest of the network, i.e. a larger standard deviation Ajo-. 
Correlations produce more marked differences between 
"preferred" and "disfavoured" patterns, in their ability 
to lead to a latching transition; the distance between the 
dotted standard deviation curves in Fig.[T7]is larger, and 
consequently latching tends to start off earlier, as a func- 
tion of an increasing w, and to self-sustain, and eventu- 
ally to be blocked by attractor stability, later. We do not 
attempt here an analytical estimate of the A j^- produced 
by our algorithm for generating correlated patterns, but 
simulations show, in Fig. 1181 that the boundaries in phase 
space are modified as expected. The extent of the latch- 
ing region is therefore amplified, already in the situa- 
tion shown, with only weakly correlated patterns. With 
stronger correlation, the finite latching phase becomes 
even wider, but to consider the functionality of the as- 
sociative network one has to remember that retrieval ca- 
pacity can be seriously impaired, as indicated in Fig. [S] 
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Figure 18: (Color online) The w-T phase space for a cor- 
related set of patterns. Similarly to Fig. 1161 dots refer to 
Wc values dividing the different regions of latching dynam- 
ics. Empty dots, the same as in Fig. 1161 refer to a randomly 
generated set of patterns, full dots to a weakly correlated set 
(C = 10-^ap/=0.4). 
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V. IMPLICATIONS AND FURTHER WORK 

We study here how the introduction of an adaptive el- 
ement affects the dynamics of an autoassociative neural 
network. The model draws a sketch of the cortex as a net- 
work of Potts units, each of them representing a cortical 
patch, and interacting with each other through recurrent 
connections (Sec. Ill A)) . The stability of the stored mem- 
ories, which are encoded in the connections between the 
units, is weakened by adaptive thresholds, which oppose 
the permanence in the current attractor. In this way, the 
network may be pushed out of the basin of attraction 
of the retrieved memory, and then it can slide towards 
a new one, performing what we have named a latching 
transition (Sec. Ill B|) . 

The introduction of adaptation is found to lead to a 
variety of phases of network dynamics. Depending on its 
parameters, the network is either unable to perform any 
transition, remaining inactive after the retrieval of a cued 
memory; or it does go through some transitions, jumping 
from memory to memory in a sequential retrieval chain 
that terminates in the inactive state; or through endless 
transitions, when the latching process self sustains in- 
definitely in time (Sec. IIII[) . In particular, each of these 
phases can be reached by changing the number of local 
attractors assumed to exist in each cortical patch (S*), 
the number of memories stored in the global network 
(p); the connectivity between cortical patches (C), the 
local feedback (w) or the noise internal to each patch 

(T) rsec lmcl) . 

The infinite latching region is however found not to be 
always accessible, and its extension in parameter phase 
space to be restricted by the presence of two other dy- 
namics regions. The increase in memory load of the 
network; which would favour a sustained latching chain, 
also impairs the ability of the system to retrieve indi- 
vidual memories. The so-called storage capacity (indeed, 
retrieval capacity) limit can be crossed either after the in- 
definite latching region is reached, and then it serves as 
its upper boundary, or before, and then it prevents the 
system from ever entering such a phase (Sec. IIIIP]) . On 
the other hand, attractor stability can also be enhanced 
by those same manipulations that favour latching. As we 
saw in section IIVDI an increase in the feedback param- 
eter, w, leads to a condition in which attractors are so 
reinforced, as to become stable. This sets an alternative 
phase boundary to the latching region. 

The introduction of adaptive elements brings a com- 
plication in the analytical description. The effective field 
imposed on a unit comes to depend on the time spent 
by the unit itself in that particular activation state (and 
in preceding states, as well). As a consequence, it is 
impossible to write down a function of state governing 
the dynamics of the system. To bypass this problem, 
we largely restrict ourselves to a slow adaptation regime 
and we treat separately the fast attractive from the slow 
adaptive dynamics (Sec. Ill B]) . The time scale separation 
allow us to focus on the specific instant when attrac- 



tors begin to switch, and to derive analytically an ap- 
proximate phase boundary between the no latching and 
latching conditions and the indefinite latching and stable 
attractor conditions, in the w-T phase space (Sec lIVE| . 
The analytical treatment can also be adjusted to fit a 
type of fast adaptation regime, intended to model rapid 
local inhibition. 

In this somewhat complex scenario, we have finally 
considered how correlations among patterns may affect 
network dynamics. With a two-step algorithm, we gen- 
erate sets of patterns with different distributions of pair- 
wise correlation (Sec. Ill Dp . Stronger correlation is found 
to decrease the storage capacity of the network, and to 
shrink their basins of attraction (Sec. HIE)) . On the other 
hand the very same destabilization brought about by cor- 
relation makes the system more prone to latching tran- 
sitions. We sketch the w-T phase space in the case of 
weakly correlated patterns, and we see that enhanced 
correlation widens the latching phase, by both lowering 
the value of w at the no latching-latching boundary and 
increasing the value needed to stabilize attractors. 



A. Applications to analyses of cortical processing 

Despite its high level of abstraction, the model can 
serve as a mathematically well-defined benchmark, where 
to test general hypotheses about cortical dynamics. An 
obvious example is any relation hypothesised to exist 
between the various parameters mentioned above and 
the duration and characteristics of structured, sponta- 
neous, non-stimulus-driven cortical activity. Experimen- 
tal paradigms have been used to explore such activity, 
even in non-human primates, e.g. in the free drawing 
task developed by Moshe Abeles and coworkers [51| • The 
behavioural model and the mathematical model can be 
brought to bear onto each other. Further work is needed, 
however, to understand how to experimentally manipu- 
late and control any of the parameters, in particular the 
non-structural ones, including the storage load p, or the 
local noise level T, or the relative degree of local feed- 
back w, so as to investigate any change induced in the 
dynamics. 

Another issue that can be probed with the model is the 
relation between the notions of a global workspace |52| 
and of serial vs. parallel processing [53|- At the quali- 
tative level of information processing models, the avail- 
ability of a global workspace can be taken to imply, ir- 
respective of the specialization of individual components 
|54| that one and only one information processing oper- 
ation can occur within it at any given time. Hence, a 
single interconnected network may appear to force serial 
processing: the lack of segregating boundaries, instead 
of being a resource, is turned into a nuisance; it imposes 
a bottleneck. Items that have been processed in paral- 
lel along separate sensory pathways can, on their way to 
distinct motor pathways, get stuck, so to speak, through 
a router, i.e. a distinct piece of cortical machinery with 
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ad hoc properties |55| . which, like any centralised bureau- 
crat, acts as a bottleneck |56| . 

The Potts model, in contrast, with its generic autoas- 
sociative architecture, indicates that lack of anatomical 
segregation is not necessarily equivalent to serial process- 
ing. Processing is effectively serial, in the model, when 
latching dynamics is particularly clean, involving a sin- 
gle global attractor at a time, a situation which does 
occur in certain regions of parameter space. In other 
regions, however, dynamics is messier, with several at- 
tractors simultaneously active, and latching transitions 
that involve only part of a structurally complex, but non- 
segregated network. A rapid transition from attractor A 
to D may occur while attr actors B and C are also active, 
over units largely distinct from those active in either A 
or D. Can the system in this case be argued to operate 
along three separate streams? Hardly so, because the 
streams are not segregated, and their support basis, i.e. 
the corresponding subset of active Potts units, changes 
all the time. The distinction between serial and parallel 
therefore ceases to be as clear cut as it normally is in en- 
gineering systems, and it becomes a quantitative rather 
than qualitative issue. The whole network can be re- 
garded as a single interactive workspace for multifarious 
parallel processes, like a modern urban society. These as- 
pects require a dedicated study, that adapts information 
theoretical measures to assess the dependence between 
successive attractor states |27| . 

A third domain of applicability of the model is in 
studying the relation between local and global attrac- 
tors. In the Potts model, local attractor states are in- 
corporated in the definition itself, which is regarded as 
a reduced or effective theory for a full-fledged neuronal 
model [2^. The search for experimental evidence for such 
states has had clearer successes in the hippocampus [a Hi 
than in higher visual areas of the cortex 0, §• Never- 
theless, data compatible with attractor-like behavior is 
appearing in a variety of preparations, analysed with dif- 
ferent approaches, from Hidden Markov Models applied 
to monkey frontal cortex [57| and to the rat gustatory 
system [58| to kernel methods applied to multi-unit ac- 
tivity from rat anterior cingulate cortex [53 ■ A recurrent 
question is whether such signatures reflect sequences of 
global attractors states, as have been hypothesized to 
underly also perceptual decisions in humans, as discrete 
moments of thought [60| . A full mechanistic understand- 
ing of such phenomenology seems to require multi-level 
neuronal network models, where both local and global 
scales (and possibly intermediate ones) are represented 
explicitly. The complexity of such model, however, is 
daunting, so that the Potts model can provide a more 
tractable approach. 



B. Evolutionary perspective 

An evolutionary perspective [6l| can make use of the 
Potts model to explore the hypothesis [2J| that a criti- 



cal step leading to human forms of cognition is a phase 
transition, incurred as a result of a quantitative change 
in some of the parameters characterizing the primate (or 
indeed mammalian) cortex. In this hypothesis, no ma- 
jor structural change differentiates human cortical net- 
works from those of at least fellow great apes, but a 
qualitatively different functionality of the very same net- 
works may have arisen out of an increase in size, or in 
connectivity, that has induced a phase transition. The 
Potts model offers several candidate transitions: from no 
latching to finite latching, from finite to infinite latch- 
ing, as well as the percolation-type of transition that, 
in the slowly adapting regime, follows closely the onset 
of latching. Identifying the most appropriate scenario is 
made somewhat complicated by the fact that a plausible 
cortical model would presumably include both rapidly 
adapting and slowly adapting thresholds [25|, and the 
two opposite limit cases would have, at the very least, to 
be combined. This is left for further work. Nevertheless, 
the notion that spontaneous cortical activity can proceed 
longer in a larger or more connected brain (not in a scal- 
ing relation, but dramatically longer, as after a phase 
transition) resonates with the notion of infinite recursiv- 
ity, pu t forward in linguistics [62| and in other domains 
|63l - [65| . as we have discussed elsewhere [6a |. 

In this view, the parameters that may constrain cor- 
tical functionality are those that are kept at relatively 
low values, in non-human species, for structural reasons. 
Indeed, some of the parameters in the Potts model can 
be thought to reflect quantities that can be freely set at 
optimal values, and these may include the relative lo- 
cal feedback strength w, the effective noise level T, or 
the adaptation time scales [67|. Other parameters in the 
model, including the number of patches A'^, their connec- 
tivity C, the number of attractor states in each patch, 
S, or the storage load p, reflect cortical quantities that 
it may be more difficult to act upon, either because in- 
creasing their values requires reformulating or extending 
developmental programmes, or because of their mutual 
relationships. Thus, for example, increasing the memory 
storage load is always possible, but it is of little use if, as 
a result, the system becomes overloaded, beyond its stor- 
age capacity, set mainly by the connectivity and by the 
density of local attractor states. Therefore an increase 
in p may only be enabled by a concurrent increase in C 
and/or S [eg. 

It would be desirable, then, to consider the phase- 
transition-to-human-cognition hypothesis by studying 
the phase diagram of the Potts model in relation to 
the parameters C,N,S,p. Unlike w and T, which af- 
fect mainly the single-unit terms, these parameters enter 
largely (except for a minor feedback role for S) in the 
signal conveyed by the rest of the network, and in partic- 
ular they determine its variability Aj^. The functional 
dependence of Aj„ on C, N, S and p is however quite dif- 
ferent in different models of the pattern generation pro- 
cess. It can be estimated analytically for randomly cor- 
related patterns, and numerically for patterns generated 
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by our multi-factorial process, but other models for this 
key process may be proposed that would lead to rather 
different effects on the variability of the signal arriving 
at individual cortical patches, and hence on the phase 
boundaries of the system. This challenging issue is left 
for future investigations. 
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